Voice Simulation: Factors Affecting Quality And Naturalness
نویسندگان
چکیده
In this paper we describe a f lexib le analysls-synthesls system which can be used for a number of studies In speech research. The maln objective Is to have a synthesis system whose characteristics can be controlled through a set of parameters to realize any desired voice characteristics. The basic synthesis scheme consists of two steps: Generation of an excitation signal from pitch and galn contours and excitation of the linear system model described by linear prediction coefficients, We show that a number of basic studies such as time expansion/ compression, pitch modif icat ions and spectral expansion/compression can be made to study the e f fec t of these parameters on the qua l i ty of synthetic speech. A systematic study is made to determine factors responsible for unnaturalness tn synthetic speech. I t i s found that the shape of the g lo t ta l pulse determines the qua l i ty to a large extent. We have also made some studies to determine factors responsible for loss of I n t e l l i g i b i l i t y tn some segments of speech. A signal dependent analysts-synthesis scheme ts proposed to improve the i n t e l l i g i b i l i t y of dynamic sounds such as stops. A simple implementation of the signal dependent analysis is proposed.
منابع مشابه
Voice Analysis in English and Persian Persuasive Texts: Pedagogical implications in focus
The main purpose of this study is to investigate how voice is realized by Iranian EFL learners in persuasive English and Persian text types. This discourse-related notion is a required criterion for writing acceptable English. However, L2 learners from cultures other than English might face problems in realizing it, or even ignore it all through their writing. In this connection, the present st...
متن کاملVoice Analysis in English and Persian Persuasive Texts: Pedagogical implications in focus
The main purpose of this study is to investigate how voice is realized by Iranian EFL learners in persuasive English and Persian text types. This discourse-related notion is a required criterion for writing acceptable English. However, L2 learners from cultures other than English might face problems in realizing it, or even ignore it all through their writing. In this connection, the present st...
متن کاملOn-line experimental methods to evaluate text-to-speech (TTS) synthesis: effects of voice gender and signal quality on intelligibility, naturalness and preference
Three experiments are reported that use new experimental methods for the evaluation of text-to-speech (TTS) synthesis from the user’s perspective. Experiment 1, using sentence stimuli, and Experiment 2, using discrete ‘‘call centre’’ word stimuli, investigated the effect of voice gender and signal quality on the intelligibility of three concatenative TTS synthesis systems. Accuracy and search t...
متن کاملImprovement of prosodic characteristic in Vietnamese speech synthesis system base on HMM
The key factors helping people to understand the synthesized voices of text-to-speech system are the naturalness and the intelligibility. However, making more natural voices remains a difficult task because of the speech data’s scarcity. With data limited corpus, prosodic information such as tone, intonation, Part-of-Speech is added to ensure the quality of synthetic speech. In the paper, we in...
متن کاملVocal Disorders and Risk Factors Affecting It: Voice Ergonomics in Teachers
Introduction: Nearly a third of people work in jobs that use voice to be part of their work. Teachers as the largest group of professional vocal users, are at risk of vocal disorders. The aim of this study was to investigate the effect of different risk factors on vocal disorders in teachers. Material and Methods: This is a cross-sectional and descriptive-analytic study that was conducted on...
متن کاملFactors affecting perceived quality and intelligibility in the CHATR concatenative speech synthesiser
In order to eliminate trial-and-error in the process of selecting a good speech database as a voice source for concatenative speech synthesis, and to determine the acoustic and prosodic characteristics that best predict `appeal' or perceived `quality' in the synthesised speech, we performed tests to evaluate listener preferences over a range of di erent synthesised voices. We found that variati...
متن کامل